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MODIFIED BRYODIN J 



FIELD OF THE INVENTION 

The present invention relates to polypeptides to be administered especially to humans and 
5 in particular for therapeutic use. The polypeptides are modified polypeptides whereby the 
modification results in a reduced propensity for the polypeptide to elicit an immune 
response upon adminis tration to the human subject. The invention in particular relates to 
the modification of bryodin 1 to result in bryodin 1 proteins that are substantially non- 
immunogenic or less immunogenic than any non-modified counterpart when used m vivo. 
10 The invention relates furthermore to T-cell epitope peptides derived from said non- 
modified protein by means of which it is possible to create modified bryodin 1 variants 
with reduced immunogenicity. 

BACKGROUND OF THE INVENTION 

15 There are many instances whereby the efficacy of a therapeutic protein is limited by an 
unwanted immune reaction to the therapeutic protein. Several mouse monoclonal 
antibodies have shown promise as therapies in a number of human disease settings but in 
certain cases have failed due to the induction of significant degrees of a human anti- 
murine antibody (HAMA) response [Schroff, R. W. et al (1985) Cancer Res. £5: 879-885; 

20 Shawler, D.L. et al (1985) J. Immunol. 135_: 1530-1535]. For monoclonal antibodies, a 
number of techniques have been developed in attempt to reduce the HAMA response 
[WO 89/09622; EP 0239400; EP 0438310; WO 91/06667]. These recombinant DNA 
approaches have generally reduced the mouse genetic infonnation in the final antibody 
construct whilst increasing the human genetic information in the final construct. 

25 Notwithstanding, the resultant "humanized" antibodies have, in several cases, still elicited 
an immune response inpatients |Issacs J.D. (1990) Sem. Immunol. 2: 449, 456; Rebello, 
P.R. etal (1999) Transplantation 6jJ: 1417-1420]. 

Antibodies are not the only class of polypeptide molecule administered as a therapeutic 
30 agent against which an immune response may be mounted. Even proteins of human 

origin and with the same amino acid sequences as occur within humans can still induce an 
immune response in humans. Notable examples include the therapeutic use of 
granulocyte-macrophage colony stimulating factor [Wadhwa, M. etal (1999) Clin. 
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Cancer Res, 5: 1353-1361] and interferon alpha 2 [Russo, D. et al (1996) Brl J. Haem. 
94: 300-305; Stein, R. et al (1988) New Engl X Med, 318: 1409-1413] amongst others. 

A principal factor in the induction of an immune response is the presence within the 
5 protein of peptides that can stimulate the activity of T-cells via presentation on MHC 
class II molecules, so-called H T-cell epitopes". Such potential T-cell epitopes are 
commonly defined as any amino acid residue sequence with the ability to bind to MHC 
Class II molecules. Such T-cell epitopes can be measured to establish MHC binding. 
Implicitly, a "T-cell epitope" means an epitope which when bound to MHC molecules can 
10 be recognized by a T-cell receptor (TCR), and which can, at least in principle, cause the 
activation of these T-cells by engaging a TCR to promote a T-cell response. It is, 
however, usually understood that certain peptides which are found to bind to MHC Class 
E molecules may be retained in a protein sequence because such peptides axe recognized 
as "self within the organism into which the final protein is administered, 

15 

It is known, that certain of these T-cell epitope! peptides can be released during the 
degradation of peptides B polypeptides or proteins within cells and subsequently be • 
presented by molecules of the major histocompatability complex (MHC) in order to 
trigger the activation of T-cells. For peptides presented by MHC Class EL, such activation 
20 of T-cells can then give rise, for example, to an antibody response by direct stimulation of 
B-cells to produce such antibodies. 



MHC Class II molecules are a group of highly polymozphic proteins which play a central 
role in helper T-cell selection and activation. The human leukocyte antigen group DR 

25 (HLA-DR) are the predominant isotype of this group of proteins and are the major focus 
of the present invention. However, isotypes HLA-DQ and HLA-DP perform similar 
functions, hence the present invention is equally applicable to these. The MHC class II 
DR molecule is made of an alpha and a beta chain which insert at their C-tennini through 
the cell membrane. Each hetero-dimer possesses a ligand binding domain which binds to 

30 peptides varying between 9 and 20 amino acids in length* although the binding groove 
can accommodate a maximum of 1 1 amino acids. The ligand binding domain is 
comprised of amino acids 1 to 85 of the alpha chain, and amino acids 1 to 94 of the beta 
chain. DQ molecules have recently been shown to have an homologous structure and the 
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DP family proteins are also expected to be very similar. In humans approximately 70 
different allotypes of the DR isotype are known, for DQ there are 30 different allotypes 
and for DP 47 different allotypes are known. Each individual bears two to four DR 
alleles, two DQ and two DP alleles, The structure of a number of DR molecules has been 

5 solved and such structures point to an open-ended peptide binding groove with a number 
of hydrophobic pockets which engage hydrophobic residues (pocket residues) of the 
peptide [Brown et al Nature (1993) 36£ 33; Stem et al (1994) Nature 36& 215], 
Polymorphism identifying the different allotypes of class II molecule contributes to a 
wide diversity of different binding surfaces for peptides within the peptide binding grove 

10 and at the population level ensures maximal flexibility with regard to the ability to 
recognize foreign proteins and mount an immune response to pathogenic organisms. 
There is a considerable amount of polymorphism within the Iigand binding domain with 
distinct "families" within different geographical populations and ethnic groups. This 
polymorphism affects the binding characteristics of the peptide binding domain, thus 

15 different '^families" of DR molecules will have specificities for peptides with different 
sequence properties, although there may be some overlap. This specificity determines 
recognition of Th-cell epitopes (Class II T-cell response) which are ultimately responsible 
for driving the antibody response to B-cell epitopes present on the same protein from 
which the Th-cell epitope is derived. Thus, the immune response to a protein in an 

20 individual is heavily influenced by T-cell epitope recognition which is a function of the 
peptide binding specificity of that individual's HLA-DR allotype. Therefore, in order to 
identify T-cell epitopes within a protein or peptide in the context of a global population, it 
is desirable to consider the binding properties of as diverse a set of HLA-DR allotypes as 
possible, thus covering as high a percentage of the world population as possible. 

25 

An immune response to a therapeutic protein proceeds via the MHC class II peptide 
presentation pathway. Here exogenous proteins are engulfed and processed for 
presentation in association with MHC class II molecules of the DR* DQ or DP type. 
MHC Class II molecules are expressed by professional antigen presenting cells (APCs), 
30 such as macrophages and dendritic cells amongst others. Engagement of a MHC class n 
peptide complex by a cognate T-cell receptor on. the surface of the T-cell, together with 
the cross-binding of certain other co-receptors such as the CD4 molecule, can induce an 
activated state within the T-cell. Activation leads to the release of cytokines further 




Empf anssze i t ll.Juni 14:56 



. 102103P-BZ-0206ll.doc 




4/38 



activating other lymphocytes such, as B cells to produce antibodies or activating T killer 
cells as a foil cellular immune response. 

The ability of a peptide to bind a given MHC class n molecule for presentation on the 
surface of an APC is dependent on a number of factors most notably its primary 
5 sequence. This will influence both its propensity for proteolytic cleavage and also its 
affinity for binding within the peptide binding cleft of the MHC class II molecule. The 
MHC class n / peptide complex on the APC surface presents a binding face to a particular 
T-cell receptor (TCR) able to recognize deteiminants provided both by exposed residues 
of the peptide and the MHC class II molecule. 

10 

In the art there are procedures for identifying synthetic peptides able to bind MHC class II 
molecules (e.g, W098/52976 aad WO00/34317), Such peptides may not fonction as T- 
cell epitopes in all situations 3 particularly, in vivo due to the processing pathways or other 
phenomena. T-cell epitope identification is the first step to epitope elimination. The 
( 15 identification and removal of potential T-cell epitopes from proteins has been previously 
disclosed. In the art methods have been provided to enable the detection of T-cell epitopes 
usually by computational means scanning for recognized sequence motifs in 
experimentally determined T-cell epitopes or alternatively using computational 
techniques to predict MHC class H-binding peptides and in particular DR-binding 
20 peptides. 

W098/52976 and WO00/34317 teach computational threading approaches to identifying 
polypeptide sequences with the potential to bind a sub-set of human MHC class n DR 
allotypes. In these teachings, predicted T-cell epitopes are removed by the use of 
25 judicious amino acid substitution within the primary sequence of the therapeutic antibody 
or non-antibody protein of both non-human and human derivation. 

Other techniques exploiting soluble complexes of recombinant MHC molecules in 
combination with synthetic peptides and able to bind to T-cell clones from peripheral 
30 blood samples from human, or experimental animal subjects have been used in the art 
[Kem, P. et al (1998) Nature Medicine 4:975-978; Kwok, W.W. et al (2001) TRENDS in 
Immunol. 22:583-588]. These and other schemes including for example the use of whole 
proteins or synthetic peptides or variant molecules to the protein of interest may be 
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screened for molecules -with altered ability to bind or stimulate T-cells may equally be 
exploited in an epitope identification strategy. 

As depicted above and as consequence thereof, it would be desirable to identify and to 
5 remove or at least to reduce T-cell epitopes from a given in principal therapeutically 
valuable but originally immunogenic peptide, polypeptide or protein, 

One of these therapeutically valuable molecules is bryodin L The present invention 
provides for modified forms of bryodin 1 with one or more T cell epitopes removed- The 
10 sequence of bryodin 1 protein as given by Gawlak et al [Gawlalc, S. et al (1997) • 
Biochemistry 36:3095-3103] is depicted in single-letter code as follows: 

DVSFRLSGATTTSYGVFIKNLREMiPYERKVYNIPLLRSSISGSGRYTLLHLTNYADETI 
SVAVDVTNVYIMGYLAGDVSYFFNEASATEAAKE^ 
15 IRENIPLGLPALDSAITTLYYYTASSAASALLVLIQSTAESARYKFIEQQIGKRVDKTFL 
PSLATISLENNWSALSKQIQIASTNNGQFESPVVLIDGNNQRVSITNASARWTSNIALL 
LNRNNIAAIGEDISMTLIGFEHGLYGI 

The bryodin 1 protein is single polypeptide of 267 amino acids with a molecular weight 
20 of approximately 29,000 Da, Bryodin 1 is a type 1 ribosome inactivating protein CRIP) 
originally isolated from the roots of the plant Bryonia dionica [US,5541 1 10]. There is 
considerable interest in this and other MPs on account of their toxicity to living cells. In 
particular recombinant forms in fusion with cell-specific targeting domains (e.g. 
antibodies) have potential value in many therapeutic areas where the selective killing of 
25 particular cell populations is a desired outcome. 

It is a particular objective of the present invention to provide modified bryodin 1 proteins 
in which the immune characteristic is modified by means of reduced numbers of potential 
T-cell epitopes. 

30 Others have provided bryodin molecules and in particular recombinant bryodin 1 

[US,5541 110; US,5932447], but these teachings do not recognise the importance of T cell 
epitopes to the immunogenic properties of the protein nor have been conceived to directly 
influence said properties in a specific and controlled way according to the scheme of the 
present invention. 
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By contrast, the PCT patent application WO00/343 17 published 15-June 2000 discloses a 
modified bryodin 1 molecule including substitutions at positions 5 ? 6, 1 8, 27, 111, 164, 
216, 222 s 237 and 249. The substitutions have been selected on the basis of an in silico 
motif matching tool and do not address fee most biologically relevant MHC class n 
5 epitopes detected in a biological assay and which are for the first time disclosed herein. 

There is a continued need for bryodin 1 analogues with enhanced properties. Desired 
enhancements include alternative schemes and modalities for the expression and 
purification of the said therapeutic, but also and especially, improvements in the 
10 biological properties of the protein. There is a particular need for enhancement of the in 
vivo characteristics when administered to the human subject In this regard, it is highly 
desired to provide bryodin 1 with reduced or absent potential to induce an immune 
response in the human subject, 

15 SUMMARY AND DESCRIPTION OF THE INVENTION 

The present invention provides for modified forms of bryodin 1, in which the immune 
characteristic is modified by means of reduced or removed numbers of potential T-cell 
epitopes. 

The invention discloses sequences identified within the bryodin 1 primary sequence that 
20 are potential T-cell epitopes by virtue of MHC class H binding potential. This disclosure 
specifically pertains the bryodin 1 protein being 267 amino acid residues. 
The invention discloses also specific positions within the primary sequence of the 
molecule which according to the invention are to be altered by specific amino acid 
substitution, addition or deletion whilst retaining to a maximum degree the biological 
25 activity of the protein. 

The invention furthermore discloses methods to produce such modified molecules, and 
above all methods to identify said T-cell epitopes which require alteration in order to 
reduce or remove immunogenic sites. 

The present invention provides for modified forma of bryodin 1 proteins that are expected 
30 to display enhanced properties in vivo. The present invention discloses the major regions 
of the bryodin primary sequence that are immunogenic in man and provides modification 
to the said sequences to eliminate or reduce the immunogenic effectiveness of these sites, 
hi one embodiment, synthetic peptides comprising the said immunogenic regions can be 
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provided in pharmaceutical composition for the purpose of promoting a tolerogenic 
response to the whole molecule. In a further embodiment, the modified bryddin 1 
molecules of the present invention can be used in pharmaceutical compositions. 

5 

In summary the invention relates to the following issues: 

• a modified molecule having the biological activity of bry odin 1 and being 
substantially non-immunogenic or less immunogenic than any non-modified molecule 
having the same biological activity when used in vivo; 

10 • an accordingly specified molecule, wherein said loss of iramunogenicity is achieved 
by removing one or more T-cell epitopes derived from the originally non-modified 
molecule; 

• an accordingly specified molecule, wherein said loss of immunogenicily is achieved 
by reduction in numbers of MHC allotypes able to bind peptides derived from said 

15 molecule; 

• an accordingly specified molecule, wherein one T-cell epitope is removed; an 
accordingly specified molecule, wherein said originally present T-cell epitopes are 
MHC class II ligands or peptide sequences which show the ability to stimulate or bind 
T-cells via presentation on class H; 

20 • an accordingly specified molecule, wherein said peptide sequences are selected from 
the group as depicted in FIGURE 1; 

• an accordingly specified molecule, wherein 1-9 amino acid residues, preferably one 
amino acid residue in any of the originally present T-cell epitopes are altered; an 
accordingly specified molecule, wherein the alteration of the amino acid residues is 

25 substitution, addition or deletion of originally present amino acid(s) residue(s) by 
other amino acid residue(s) at specific positions); 

• an accordingly specified molecule, wherein, if necessary, additionally further 
alteration usually by substitution, addition or deletion of specific amino acid(s) is 
conducted to restore biological activity of said molecule; 

30 • a peptide molecule of above sharing greater than 90% amino acid identity with any of 
the peptide sequences of FIGURE 1; 

• a peptide molecule of above sharing greater than 80% amino acid identity with any of 
the peptide sequences of FIGURE 1 ; 
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• peptide sequences as above able to bind MHC class II; 

• an accordingly specified bryodin 1 molecule, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
specified within FIGURE 1 ; 

5 * an accordingly specified molecule wherein alteration is conducted at one or more" 

residues from any or all of the string of contiguous residues of sequences (a), (b), (c), 
(d), or (e) as below wherein said sequences are derived from the bryodin 1 wild-type 
sequence where using single letter code; 

(a) = ITTLYYYTASS AASALLVLIQSTAESA, 

10 (b) - ATEAAKFVFKD AKKK, 

(c) « ERLQTAAOKIRENIPLGLPALDSA, 

(d) - ITTLYYYTASSAASALLVLIQSTAESA, 

(e) = TFLPSLAHSI^NNWSALSKQIQIAST, 

• a peptide molecule comprising 13-15 consecutive residues from any of sequences (a) 9 
15 (b) 3 (c) s (d) or (e) above; 

• a peptide molecule comprising at least 9 consecutive residues from any of the 
sequences (a), (b), (<0 (d) or (e) above; 

• a peptide molecule of above sharing greater than 90% amino acid identity with any of 
the peptide sequences derived from (a), (b) 9 (d) or (e) above; 

20 • a peptide molecule of above sharing greater than 80% amino acid identity with any of 
the peptide sequences derived from (a), (b), (c) (d) or (e) above; 

• peptide sequences as above able to bind MHC class II; 

• an accordingly specified bryodin 1 molecule., wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 

25 specified within any of sequences (a), (b), (c), (d) or (e) above; 

• an accordingly specified bryodin 1 molecule, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
specified within sequence (b) above; 

i?... & peptide sequence consisting of at least 9 consecutive amino acid residues of any of 
30 the sequences (a), (b), (c), (d) or (e) as specified above and its use for the manufacture 
of bryodinl, a-trichosanthin, a-momorcharin or P-momorcharin having substantially 
no or less immunogenicity than any non-modified molecule and having the biological 
activity of a type 1 RIP when used in vivo; 
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• a pharmaceutical composition comprising any of the peptides or modified peptides of 
above having the activity of binding to MHC class H; 

• a DNA sequence or molecule which codes for any of said specified modified 
molecules as defined above and below; 

s • a pharmaceutical composition comprising a modified molecule having the biological 
activity of bryodin 1; 

• a phaimaceutical composition as defined above and / or in the claims, optionally 
together with a pharmaceutical^ acceptable carrier, diluent or excipient; 

• a melhod for manufacturing a modified molecule having the biological activity of 
10 bryodin 1 as defined in any of the claims of the above-cited claims comprising the 

following steps: © determining the amino acid sequence of the polypeptide or part 
thereof; (ii) identifying one or more potential T-cell epitopes within the amino acid 
sequence of the protein by any method including determination of the binding of the 
peptides to MHC molecules using in vitro or in silica techniques or biological assays; 

15 (iii) designing new sequence variants with one or more amino acids within the 

identified potential T-cell epitopes modified in such a way to substantially reduce or 
eliminate ihe activity of the T-cell epitope as determined by the binding of the peptides 
to MHC molecules using in vitro or in silico techniques or biological assays; (iv) 
constructing such sequence variants by recombinant DNA techniques and testing said 

20 variants in order to identify one or more variants with desirable properties; and (v) 
optionally repeating steps (ii) - (iv); 

• an accordingly specified method, wherein step (iii) is carried out by substitution, 
addition or deletion of 1 - 9 amino acid residues in any of the originally present T-cell 
epitopes; 

25 • an accordingly specified method, wherein the alteration is made with reference to an 
homologous protein sequence and' / or in silico modeling techniques; 

• an accordingly specified method, wherein step (ii) of above is carried out by the 
following steps: (a) selecting a region of the peptide having a known amino acid 
residue sequence; (b) sequentially sampling overlapping amino acid residue segments 

30 of predetermined uniform size and constituted by at least three amin o acid residues 

from the selected region; (c) calculating MHC Class H molecule binding score for each 
said sampled segment by summing assigned values for each hydrophobic amino acid 
residue side chain present in said sampled amino acid residue segment; and (d) 
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15 



20 



identifying at least one of said segments suitable for modification, based on tiie 
calculated MHC Class II molecule binding score for that segment, to change overall 
MHC Class II binding score for the peptide without substantially reducing therapeutic 
utility of the peptide; step (c) is preferably carried out by using a Bohm scoring 
function modified to include 12-6 van der Waal's ligand-protein energy repulsive term 
and ligand conformational energy term by (1) providing a first data base of MHC Class 
II molecule models; (2) providing a second data base of allowed peptide backbones for 
said MHC Class II molecule models; (3) selecting a model from said first data base; 
(4) selecting an allowed peptide backbone from said second database; (5) identifying 
amino acid residue side chains present in each sampled segment; (6) deter mining the 
binding affinity value for all side chains present in each sampled segment; and 
repeating steps (1) through (5) for each said model and each said backbone; 

• a 13mer T-cell epitope peptide having a potential MHC class II binding activity and 
created from non-modified bryodin 1, selected from the group as depicted in FIGURE 
1 and its use for the manufacture of bryodin 1 having substantially no or less 
immunogenicity than any non-modified molecule with the same biological activity 
when used in vivo; 

• a peptide sequence consisting of at least 9 consecutive amino acid residues of a 13mer 
T-cell epitope peptide as specified above and its use for the manufacture of bryodin 
having substantially no or less immunogenicity than any non-modified molecule and 
having the biological activity of bryodin 1 when used in vivo; 

m a 13mer T-cell epitope peptide having a potential MHC class II binding activity and 
created from non-modified bryodin 1, selected from any of the group of sequences 
(FIGURE 1) and its use for the manufacture of bryodin 1 having substantially no or 
less immunogenicity than any non-modified molecule and having the biological 
activity of a bryodin 1 molecule when used in vivo; 

• a peptide sequence consisting of at least 9 consecutive amino acid residues of a 13mer 
T-cell epitope peptide as derived from any of the sequences in FIGURE 1 and its use 
for the manufacture of bryodin 1 having substantially no or less immunogenicity than 
any non-modified molecule and having the biological activity of a bryodin molecule 
when used tn vivo. 
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The term "T-cell epitope" means according to the understanding of this invention an 
amino acid sequence which is able to bind MHC class II, able to stimulate T-cells and / or 
also to bind (without necessarily measurably activating) T-cells in complex with MHC 
class IL 

5 The term "peptide" as used herein and in the appended claims, is a compound that 

includes two or more amino acids. The amino acids axe linked together by a peptide bond 
(defined herein below). There are 20 different naturally occurring amino acids involved 
in the biological production of peptides, and any number of them may be linked in any 
order to form a peptide chain or ring. The naturally occurring amino acids employed in 

10 the biological production of peptides all have the L- configuration. Synthetic peptides can 
be prepared employing conventional synthetic methods, utilizing L-amino acids, D-amiuo 
acids, or various combinations of amino acids of the two different configurations. Some 
peptides contain only a few amino acid units. Short peptides, e.g., having less than ten 
amino acid units, are sometimes referred to as "oligopeptides". Other peptides contain a 

15 large number of amino acid residues, e.g. up to 100 or more, and are xefenred to as 

"polypeptides". By convention, a "polypeptide" may be considered as any peptide chain 
containing three or more amino acids,, whereas a "oligopeptide" is usually considered as a 
particular type of "short" polypeptide. Thus, as used herein, it is understood that any 
reference to a "polypeptide" also includes an oligopeptide. Further, any reference to a 

20 "peptide" includes polypeptides, oligopeptides, and proteins- Each different arrangement 
of amino acids forms different polypeptides or proteins. The number of polypeptides-and 
hence the number of different proteins-that can be formed is practically unlimited. 
"Alpha carbon (Ccc)" is the carbon atom of the carbon-hydrogen (CH) component that is 
in the peptide chain. A "side chain" is a pendant group to Ccc that lean comprise a simple 

25 or complex group or moiety, having physical dimensions that can vary significantly - 
compared to the dimensions of the peptide. 

The invention may be applied to any bryodin 1 species of molecule with substantially the 
same primary amino acid sequences as those disclosed herein and would include therefore 
bryodin 1 molecules derived by genetic engineering means or other processes and may 
30 contain more or less than 267 amino acid residues. 

The invention is conceived to overcome the practical reality that soluble proteins 
introduced with therapeutic intent in man trigger an immune response resulting in 
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development of host antibodies that bind to the soluble protein. The present invention 
seeks to address this by providing biyodin 1 proteins \yitb altered propensity to elicit an 
immune response on administration to the human host. According to the methods 
described herein, the inventors have discovexed the regions of the bryodin 1 molecule 
5 comprising the critical T-cell epitopes driving the immune responses to this protein. 

The general method of the present invention leading to the modified bryodin 1 comprises 
the following steps: 

(a) determining the amino acid sequence of the polypeptide or part thereof; 

10 (b) identifying one or more potential T-cell epitopes within the amino acid sequence of 
the protein by any method including determination of the binding of the peptides to MHC 
molecules using in vitro or in silico techniques or biological assays; 
(c) designing new sequence variants with one or more amino acids within the identified 
potential T-cell epitopes modified in such a way to substantially reduce or eliminate the 

15 activity of the T-oell epitope as determined by the binding of the peptides to MHC 
molecules using in vitro or in silico techniques or biological assays- Such sequence 
variants are created in such a way to avoid creation of new potential T-cell epitopes by 
the sequence variations unless such new potential T-cell epitopes are, in turn, modified in 
such a way to substantially reduce or eliminate the activity of the T-cell epitope; and 

20 (d) constructing such sequence variants by recombinant DNA techniques and testing said 
variants in order to identify one or more variants with desirable properties according to 
well known recombinant techniques. 

The identification of potential T-cell epitopes according to step (b) can be carried out 
25 according to methods describes previously in the prior art. Suitable methods are 

disclosed in WO 98/59244; WO 98/52976; WO 00/343 17 and may preferably be used to 
identify binding propensity of FVEQrderived peptides to an MHC class H molecule. 

In practice a number of variant bryodin proteins will be produced and tested for the 
30 desired immune and functional characteristic. The variant proteins will most preferably 
be produced by recombinant DNA techniques although other procedures including 
chemical synthesis of bryodin 1 fragments may be contemplated. 
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30 



The results of an analysis according to step (b) of tie above scheme and pertaining to the 
bryodin 1 protein sequence is presented in FIGURE 1. 

The invention relates to bryodin analogues in which substitutions of at least one amino 
acid residue have been made at positions resulting in a substantial reduction in activity of 
or elimination of one or more potential T-cell epitopes from the protein. One or more 
amino acid substitutions at particular points within any of the potential MHC class II 
ligands identified in FIGURE 1 may result in a biyodin molecule with a reduced 
immunogenic potential when administered as a therapeutic to the human host 



It is most preferred to provide a bryodin 1 molecule in which amino acid modification 
(e.g. a substitution) is conducted within the most immunogenic regions of the parent 
molecule. The major preferred embodiments of the present invention comprise bryodin 1 
molecules for which any of the MHC class II ligands of FIGURE 1 are altered such as to 
15 eliminate binding or otherwise reduce the numbers of MHC allotypes to which the 
peptide can bind. 

It is most preferred to provide a bryodin 1 molecule in which amino acid modification 
(e.g. a substitution) is conducted within the most immunogenic regions of the parent 
molecule. The inventors herein have discovered that the most immunogenic regions of 

20 the biyodin 1 molecule in man are confined to at least five regions Rl — R5 encompassing 
residues 46-66; 88-102; 112-135; 136-162 andl 78-204 comprising respectively amino 
acid sequences; Rl) RYTLLHLTNYADETISVAVDV; R2) ATEAAKFVFKDAKKK; 
R3) ERLQTAAGKERENIPLGLPALDSA; R4) ITTLYYYTASSAASALLVLIQSTAESA 
and R5) TFLPSLAT1SLENNWSAL5KQIQIAST, The major preferred embodiments of 

25 the present invention comprise biyodin 1 molecules for which the MHC class II ligands 
of FIGURE 1 and which align either in their entirety or to a minimum of 9 amino acid 
residues with any of the above sequence elements (Rl - R5) are altered such as to 
eliminate binding or otherwise reduce the numbers of MHC allotypes to which the 
peptide can bind. 



For the elimination of T-cell epitopes* amino acid substitutions are preferably made at 
appropriate points within the peptide sequence predicted to achieve substantial reduction 
or elimination of the activity of the T-cell epitope. In practice an appropriate point will 
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preferably equate to an amino acid residue binding within one of the pockets provided 
within the MHC class II binding groove. 

It is most preferred to alter binding within the first pocket of the cleft at the so-called P 1 
5 or PI anchor position of the peptide, The quality of binding interaction between the PI 

anchor residue of the peptide and the first pocket of the MHC class II binding groove is 

recognized as being a major determinant of overall binding affinity for the whole peptide. 

An appropriate substitution at this position of the peptide will be for a residue less readily 

accommodated within the pocket* for example, substitution to a more hydrophilic residue. 
10 Amino acid residues in the peptide at positions equating to binding within other pocket 

regions within the MHC binding cleft are also considered and fall under the scope of the 

present 

It is understood that single amino acid substitutions within a given potential T-cell epitope 
15 are the most preferred route by which the epitope may be eliminated. Combinations of 
substitution within a single epitope may be contemplated and for example can be 
particularly appropriate where individually defined epitopes are in overlap with each 
other. Moreover, amino acid substitutions either singly within a given epitope or in 
combination within a single epitope may be made at positions not equating to the "pocket 
20 residues" with respect to the MHC class II binding groove, but at any point within the 
peptide sequence. Substitutions may be made with reference to an homologues structure 
or structural method produced using in silico techniques known in the art and may be 
based on known structural features of the molecule according to this invention. All such 
substitutions fall within the scope of the present invention. 

25 

Amino acid substitutions other than within the peptides identified above may be 
contemplated particularly when made in combination with substitution^) made within a 
listed peptide. For example a change may be contemplated to restore structure or 
biological activity of the variant molecule. Such compensatory changes and changes to 
30 include deletion or addition of particular amino acid residues from the bryodin 1 

polypeptide resulting in a variant with desired activity and in combination with changes in 
any of the disclosed peptides fell under the scope of the present 
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In as far as this invention relates to modified bryodin 1, compositions containing such 
modified biyodin proteins or fragments of modified bryodin proteins and related 
compositions should be considered within the scope of the invention. In another aspect, 
the present invention relates to nucleic acids encoding modified biyodin entities. In a 
5 further aspect the present invention relates to methods for therapeutic treatment of 
humans using the modified bryodin 1 proteins. In this aspect the modified bryodin 1 
protein may be linked with on antibody molecule or fragment of an antibody molecule. 
The linkage may be by means of a chemical cross-linker or the bryodin 1 -antibody may 
be produced as a recombinant fusion protein. The fusion molecule may contain the 
10 modified bryodin 1 domain with antibody domain orientated towards the N-tenninus of 
the fusion molecule although the opposite orientation may be contemplated. 

Desired antibody specificities for linkage to the modified bryodin 1 molecule of the 
present include those directed towards internali si n g antigen determinants. Examples of 

15 this class of antigen are rare but would include the A33 antigen [Heath, J.K. et al (1997) 
Proc. Natl Acad Sci U&A. 94: 469-474] and the GA733-1 antigen [US,5,840,854]. The 
carcinoembryonic antigen may also be contemplated for use and may be targeted by any 
of numerous antibodies but may include MFE23 [Chester, K.A. et al (1994) Lancet 343: 
455], A5B7 [WO92/010159], TS4.66 [US,S,G8 1,235] MN-14 [Hansen, HJ. et al (1993) 

20 Cancer 71: 3478-3485], COL-1 [US,5,472,693] and others. Other desired specificities 
include antibodies directed to non-internalising antigens and this may include antigens 
such as the 4QKDa glycoprotein antigen as recognised by antibody KS 1/4 [Spearman et al 
(1987) J. Pharmacol Exp. Therapeutics 241 : 695-703] and other antibodies. Other 
antigens such as the epidermal growth factor receptor (HERI) or related receptors such as 

25 HER2 may be selected including anti-GD2 antibodies such as antibody 14.1 8 

[US,4,675,287; EP 0 1 92 657], or antibodies to the prostate specific membrane antigen 
[US,6,107,090], the IL-2 receptor [US.6,013,2563, the A33 antigen the Lewis Y 
determinant, mucin glycoproteins or others may be contemplated. 

30 In all instances where a modified bryodin 1 protein is made in fusion with an antibody 
sequence it is most desired to use antibody sequences in which T cell epitopes or 
sequences able to bind MHC class H molecules or stimulate T cells or bind to T cells in 
association with MHC class II molecules have been removed. 
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A further embodiment of the present invention, the modified bryodin 1 protein may be 
linked to a non-antibody protein yet a protein able to direct a specific binding interaction 
to a particular target cell. Such protein moieties include a variety of polypeptide Uganda 
5 for which there are specific cell surface receptors and include therefore numerous 
cytokines, peptide and polypeptide hormones and other biological response modifiers. 
Prominent ©camples include such proteins as vascular epithelial growth factor, epidermal 
growth factor, heregulin, the interleukins, interferons, tumour necrosis factor and other 
protein and glycoprotein molecules. Fusion proteins of these and other molecules with 
10 bryodin 1 of the present invention may be contemplated and may comprise the modified 
bryodin 1 moiety in either the N-terminal or C-terminal orientation with respect to the 
protein ligand domain. Equally, chemical cross-linking of the purified ligand to the 
modified bryodin 1 protein may be contemplated and within the scope of the present 
invention. 

15 

In a further embodiment the modified bryodin 1 protein of the present may be used as a 
complex containing a water soluble polymer such as hydroxypropylmethacrylamide or 
Other polymers where the modified bryodin 1 protein is in covalent attachment to the 
polymer or in a non-covalent binding interaction with the polymer. Such an embodiment 
20 may additionally include an antigen binding domain such as an antibody or a fragment of 
an antibody in combination with the polymer bryodin 1 complex. 

In a further aspect still, the invention relates to methods for therapeutic treatment using 
pharmaceutical preparations comprising peptide or derivative molecules with sequence 
25 identity or part identity with the sequences herein disclosed. 

In a yet further aspect, the major immunogenic epitopes herein disclosed and relating to 
the biyodin 1 molecule are also shown herein to be present within the primary sequence 
of a number of other type 1 RIP proteins of which bryodin 1 is an example. Thus the 
30 proteins a-trichosanthin (1TCS), a-momorcharin (1MOM) and p-momorcharin (1CF5) 
and others may be shown by protein sequence analysis to contain sequence elements with 
identity or near identity to the immunogenic regions of the bryodin 1 molecule. FIGURE 
3 depicts sequence comparisons between biyodin 1 major epitopes and sequence elements 
from 1TCS, 1MOM and 1 CF5 proteins. The present invention in so far as it relates to 
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peptides and modified sequences derived from the bryodin 1 protein, where the identical 
or substantially similar sequences are identified within other proteins, these are 
considered equally to fall under the scope of the present 

5 The invention will now he illustrated by the following figures: 

FIGURE 1 provides a list of peptide sequences in bryodin 1 with potential 
human MHC class II binding activty Peptides are 13-mexs, amino acids are 
identified using single letter code 

FIGURE 2 provides a table of the bryodin 1 15-mer peptide sequences analysed 
using the naJve human in vitro T-cell assay of EXAMPLE 2. The peptide ID# and 
position of the N-tenninal peptide residue within the bryodin 1 sequence is - 
indicated 

FIGURE 3 indicates the sequence elements Rl, R2 ? R3, R4 and R5 from the 
bryodin 1 (1BRY) sequence which give a stimulation index of 2.0 or greater in in 
the naiVe human in vitro T-cell assay of EXAMPLE 2. Corresponding sequences 
from related proteins a-trichosauthin (1TCS), a-momoicharin (1MOM) and 0- 
momorcharin (1CF5) axe shown beneath each bryodin 1 sequence. Sequences are 
identical to bryodin 1 except where indicated. Amino acids are depicted using 
single letter code. 



EXAMPLE 1 

There are a number of factors that play important roles in determining the total structure 
of a protein ox polypeptide. First, the peptide bond, Le„ that bond which joins the amino 
acids in the chain together, is a covalent bond. This bond is planar in structure, 
30 essentially a substituted amide. An "amide" is any of a' group of organic compounds 
containing fee grouping -CONH-. 

The planar peptide bond linking Ca of adjacent ami-no acids may be represented as 
depicted below: 



15 
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Bec aus e the 0=C and the ON atoms lie in a relatively rigid plane, free rotation does not 
occur about these axes. Hence, a plane schematically depicted by the interrupted line is 
sometimes refeired to as an "amide" or "peptide plane" plane wherein lie the oxygen (O), 
carbon (C), nitrogen (N), and hydrogen (H) atoms of the peptide backbone. At opposite 

5 corners of this amide plane are located the Ccc atoms. Since there is substantially no 
rotation about the 0=C and ON atoms in the peptide or amide plane, a polypeptide chain 
thus comprises a series of planar peptide linkages joining the Ccc atoms, 
A second factor that plays an important role in defining the total structure or 
conformation of a polypeptide or protein is the angle of rotation of each amide plane 

10 about the common Ca linkage. The terms "angle of rotation" and "torsion angle" are 
hereinafter regarded as equivalent terms. Assuming that the O, C, N, and H atoms remain 
in the amide plane (which is usually a valid assumption, although there may be some- 
slight deviations from planarity of these atoms for some conformations), these angles of 
rotation define the N and R polypeptide's backbone conformation, i.e., the structure as it 

15 exists between adjacent residues. These two angles are known as <J> and A set of the 
angles <j>i, where the subscript i represents a particular residue of a polypeptide chain, 
thus effectively defines the polypeptide secondary structure. The conventions used in 
defining the <J>, \|/ angles, i.e., the reference points at which the amide planes form a zero 
degree angle, and the definition of which angle is <k and which angle is for a given 

20 polypeptide, axe defined in the literature. See, e,g„ Ramachandran et aL Adv. Prot Ckern. 
23:283-437 (1968), at pages 285-94, which pages are incorporated herein by reference. 

The present method can be applied to any protein, and is based in part upon the discovery 
that in h umans the primary Pocket 1 anchor position of MHC Class II molecule binding 
25 grooves has a well designed specificity for particular amino acid side chains. The 

specificity of this pocket is determined by the identity of the amino acid at position 86 of 
the beta chain of the MHC Class II molecule. This site is located at the bottom of Pocket 
1 and determines the size of the side chain that can be accommodated by this pocket 

Marshall, K.W., JC Immunol., 152:4946-4956 (1994). If this residue is a glycine, then all 
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hydrophobic aliphatic and aromatic amino acids (hydrophobic aliphatics being: valine, 
leucine, isoleucine, mettnonineand aromatics being: phenylalanine, tyrosine and 
tryptophan) can be accommodated in the pocket, a preference being for the aromatic side 
chains. If this pocket residue is a valine, then the side chain of this amino acid protrudes 

5 into the pocket and restricts the size of peptide side chains that can be accommodated 
such that only hydrophobic aliphatic side chains can be accommodated. Therefore, in an 
amino acid residue sequence, wherever an amino acid with ahydrophobic aliphatic or 
aromatic side chain is found, there is the potential for a MHC Class H restricted T-cell 
epitope to be present. If the side-chain is hydrophobic aliphatic, however, it is 

10 approximately twice as likely to be associated with a T-cell epitope than an aromatic side 
chain (assuming an approximately even distribution of Pocket 1 types throughout the 
global population). 

A computational method embodying the present invention profiles the likelihood of 
peptide regions to contain T-cell epitopes as follows: 
15 (1) The primary sequence of apeptide segment of predetermined length is scanned, and 
all hydrophobic aliphatic and aromatic side chains present are identified. (2)The 
hydrophobic aliphatic side chains ate assigned a value greater than that for the aromatic 
side chains; preferably about twice the value assigned to the aromatic side chains, e.g. s a 
value of 2 for a hydrophobic aliphatic side chain and a value of 1 for an aromatic side 
20 chain. (3) The values determined to be present are summed for each overlapping amino 
acid residue segment (window) of predetermined uniform length within the peptide, and 
the total value for a particular segment (window) is assigned to a single amino acid 
residue at an intermediate position of the segment (window), preferably to a residue at 
about the midpoint of the sampled segment (window). This procedure is repeated for 
25 each sampled overlapping amino acid residue segment (window). Thus, each amino acid 
residue of the peptide is assigned a value that relates to the likelihood of a T-cell epitope 
being present in that particular segment (window). (4) The values calculated and assigned 
as described in Step 3, above, can be plotted against the amino acid coordinates of the 
enure amino acid residue sequence being assessed. (5) All portions of the sequence which 
have a score of a predetermined value, e.g., a value of 1 , are deemed likely to contain a T- 
cell epitope and can be modified, if desired. 

This particular aspect of the present invention provides a general method by which the 
• regions of peptides likely to contain T-cell epitopes can be described. Modifications to the 
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peptide in these regions have the potential to modify the MHC Class II binding 
characteristics . 

According to another aspect of the present invention, T-cell epitopes can be predicted 
with greater accuracy by the use of a more sophisticated computational method which 
5 takes into account the interactions of peptides with models of MHC Class H alleles. 

The computational prediction of T-cell epitopes present within apeptide according to this 
- particular aspect contemplates the construction of models of at least 42 MHC Class H 
alleles based upon the structures of all known MHC Class H molecules and a method for 
the use of these models in the computational identification of T-cell epitopes, the 
10 construction of libraries of peptide backbones for each model in order to allow for the 
known variability in relative peptide backbone alpha carton (Cot) positions, the 
construction of libraries of amino-acid side chain conformations for each backbone dock 
with each model for each of the 20 amino-acid alternatives at positions critical for the 
interaction between peptide and MHC Class H molecule, and the use of these libraries of 
15 backbones and side-chain conformations in conjunction with a scoring function to select 
the optimum backbone and side-chain conformation for a particular peptide docked with a 
particular MHC Class H molecule and the derivation of a binding score from this 
iotexactioiL. 

Models of MHC Class II molecules can be derived via homology modeling from a 
20 number of similar structures found in the Brookhaven Protein Data Bank C'FDB"). These 
may be made by the use of semi-automatic homology modeling software (Modeller, SaU 
A. & Blundell TL-, 1993. J. Mol Biol23±J19-Zl5) which incorporates a simulated 
annealing function, in conjunction with the CHARMm force-field for energy 
nunimisation (available from Molecular Simulations Inc., San Diego, Ca_). Alternative 
25 modeling methods can be utilized as well. 

The present method differs significantly from other computational methods which use 
libraries of experimentally derived binding data of each amino-acid alternative at each 
position in the binding groove for a small set of MHC Class H molecules (Marshall, 
K.W., etaUBiomed. Pept Proteins Nucleic Acids, 1(3):157-162) (1995) or yet other 
30 computational methods which use similar experimental binding data in order to define me 
binding characteristics of particular types of binding pockets within the groove, again 
using a relatively small subset of MHC Class H molecules, and then 'mixing and 
matching' pocket types from this pocket library to artificially create further 'virtual' . 
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MHC Class H molecules (Stutniolo T v et al, iter. Biotech, 555-561 (1999). Both 

prior methods suflfer tiie major disadvantage that, due to the complexity of the assays and 
the need to synthesize large numbers of peptide variants, only a small number of MHC 
Class H molecules can be experimentally scanned. Therefore the first prior method can 
5 only make predictions for a small number of MHC Class U molecules. The second prior 
method also makes the assumption that apocket lined with similar amino-acids in one 
molecule mil have the same binding characteristics when in the context of a different 
Class E allele and suffers further disadvantages in that only those MHC Class II 
molecules can be 'virtually' created which contain pockets contained within the pocket 

10 library, Using the modeling approach described herein, the structure of any number and 
type of MHC Class II molecules can be deduced, therefore alleles can be specifically 
selected to be representative of the global population. In addition, the number of MHC 
Class II molecules scanned can be increased by making further models further than 
having to generate additional data via complex experimentation. 

15 The use of a backbone library allows for variation in the positions of the Ccc atoms of the 
various peptides being scanned when docked with particular MHC Class II molecules. 
This is again in contrast to the alternative prior computational methods described above 
which rely on the use of simplified peptide backbones for scanning amino-acid binding in 
particular pockets. These simplified backbones are not likely to be representative of 

20 backbone conformations found in c reaT peptides leading to inaccuracies in prediction of 
peptide binding. The present backbone library is created by superposing the backbones of 
all peptides bound to MHC Class II molecules found within the Protein Data Bank and 
noting the root mean square (RMS) deviation between the Ca atoms of each of the eleven 
amino-acids located within the binding groove. While this library can be derived from a 

25 small number of suitable available mouse and human structures (currently 13), in order to 
allow for the possibility of even greater variability, the RMS figure for each C"-a position 
is increased by 50%. The average Ca position of each amino-acid is then determined and 
a sphere drawn around this point whose radius equals the RMS deviation at that position 
plus 50%. This sphere represents all allowed Ca positions. 

30 Working from the Ca with the least RMS deviation (that of the amino-acid in Pocket 1 as 
mentioned above, equivalent to Position 2 of the 1 1 residues in the binding groove), the 
sphere is three-dimentf onally gridded, and each vertex within the grid is then used as a 
possible location for a Ca of that amino-acid. The subsequent amide plane, corresponding 
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to the peptide bond to the subsequent amino-acid is grafted onto each of these Cas and 
the <j> and \\f Angles are rotated step-wise at set intervals in order to position the subsequent 
Ca. If the subsequent Ca falls within the 'sphere of allowed positions' for tins Ca than 
the orientation of the dipeptide is accepted, whereas if it falls outside the sphere then the 

5 dipeptide is rejected 

This process is then repeated for each of the subsequent Ca positions, such that the 
peptide grows from the Pocket 1 Ca E seed\ until all nine subsequent Cas have been 
positioned from all possible permutations of the preceding Cas. The process is then " 
repeated once more for the single Ca preceding pocket 1 to create a library of backbone 

1 0 Ca positions located within the binding groove. 

The number of backbones generated is dependent upon several factors: The size of the 
'spheres of allowed positions'; the fineness of the gridding of the 'primary sphere' at the 
Pocket 1 position; the fineness of the step-wise rotation of the <j> and \|/ angles used to 
position subsequent Cos. Using this process, a large library of backbones can be created. 

15 The larger the backbone library, the more likely it will be that the optimum fit will be 
found for a particular peptide within the binding groove of an MHC Class II molecule. 
Inasmuch as all backbones will not be suitable for docking with all the models of MHC 
Class II molecules due to clashes with amino^cids of the binding domains, for each allele 
a subset of the library is created comprising backbones which can be accommodated by 

20 that allele. 

The use of the backbone library, in conjunction with the models of MHC Class II 
molecules creates an exhaustive database consisting of allowed side chain conformations 
for each amino-acid in each position of the binding groove for each MHC Class II 
molecule docked with each allowed backbone. This data set is generated using a simple 

25 steric overlap function where a MHC Class H molecule is docked with a backbone and an 
amino-acid side chain is grafted onto the backbone at the desired position- Each of the 
rotatable bonds of the side chain is rotated step-wise at set intervals and the resultant 
positions of the atoms dependent upon that bond noted. The interaction of the atom with 
atoms of side-chains of the binding groove is noted and positions are either accepted or 

30 rejected according to the following criteria: The sum total of the overlap of all atoms so 
far positioned must not exceed a pre-determined value. Thus the stringency of the 
conformational search is a function of the interval used in the step-wise rotation of the 
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bond and the pre-detennined limit for the total overlap. This latter value can be small* if it 
is known that a particular pocket is rigid, however the stringency can be relaxed if the 
positions of pocket side-chains are knovvn to be relatively flexible. Thus allowances can 
be made to imitate variations in flexibility within pockets of the binding groove. This 
5 Conformational search is then repeated for every amino-acid at every position of each 
backbone when docked with each of the MHC Class II molecules to create the exhaustive 
database of side-chain conformations. 

A suitable mathematical expression is used to estimate the energy of binding between 
models of MHC Class II molecules in conjunction with peptide ligand conformations 

] 0 which have to be empirically derived by scanning the large database of backbone/side- 
chain conformations described above. Thus a protein is scanned for potential T-cell 
epitopes by subjecting each possible peptide of length varying between 9 and 20 amino- 
acids (although the length is kept constant for each scan) to the following computations: 
An MHC Class II molecule is selected together with a peptide backbone allowed for that 

15 molecule and the side-chains corresponding to the desired peptide sequence are grafted 
on. Atom identity and interatomic distance data relating to a particular side-chain at a 
particular position on the backbone are collected for each allowed conformation of that 
amino-acid (obtained from the database described above). This is repeated for each side- 
chain along the backbone and peptide scores derived using a scoring function. The best 

20 score for that backbone is retained and the process repeated for each allowed backbone 
for the selected model. The scores from all allowed backbones are compared and the 
highest, score is deemed to be the peptide score for the desired peptide in that MHC Class 
II model. This process is then repeated for each model with every possible peptide 
derived from the protein being scanned, and the scores for peptides versus models are 

25 displayed. 

In the context of the present invention, each ligand presented for the binding affinity 
calculation is an amino-acid segment selected from a peptide or protein as discussed 
above. Thus, the ligand is a selected stretch of amino acids about 9 to 20 amino acids in 
length derived from a peptide, polypeptide or protein of known sequence. The terms 
30 "amino acids" and "residues" are hereinafter regarded as equivalent terms. 

The ligand, in the form of the consecutive amino acids of the peptide to be examined 
grafted onto a backbone from the backbone library, is positioned in the binding cleft of an 
MHC Class H molecule from the MHC Class n molecule model library via the 
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coordinates of the C"-ct atoms of the peptide backbone and an. allowed conformation for 
each side-chain is selected from the database of allowed conformations- The relevant 
atom identities and interatomic distances are also retrieved from this database and used to 



5 Class II binding pocket are flagged as candidates for site-directed mutagenesis- Amino- 
acid substitutions are made in the flagged ligand (and hence in the protein of interest) 
which is then retested using the scoring function in order to determine changes which 
reduce the binding affinity below a predetermined threshold value. These changes can 
then be incorporated into the protein of interest to remove T-cell epitopes. 

10 Binding between the peptide ligand and the binding groove of MHC Class II molecules 
involves non-covalent interactions including, but not limited to: hydrogen bonds, 
electrostatic interactions, hydrophobic (lipophilic) interactions and Van der Walls 
interactions. These are included in the peptide scoring function as described in detail 
below. 

15 It should be understood that a hydrogen bond is a non-covalent bond which can be formed 
between polar or charged groups and consists of a hydrogen atom shared by two other 
atoms. The hydrogen of the hydrogen donor has a positive charge where the hydrogen 
acceptor has a partial negative charge. For the purposes of peptide/protein interactions, 
hydrogen bond donors may be either nitrogens with hydrogen attached or hydrogens 

20 attached to oxygen or nitrogen. Hydrogen bond acceptor atoms may be oxygens not 
attached to hydrogen* nitrogens with no hydrogens attached and one or two connections, 
or sulphurs with only one connection. Certain atoms, such as oxygens attached to 
hydrogens or ^tnir^ nitrogens (e.g. 0=*NH) may be both hydrogen acceptors or donors. 
Hydrogen bond energies range from 3 to 7 Kcal/mol and are much stronger than Van der 

25 Waal's bonds, but weaker than covalent bonds. Hydrogen bonds are also highly 

directional and are at their strongest when the donor atom, hydrogen atom and acceptor 
atom are co-linear- 

Electrostatic bonds are formed between oppositely charged ion pairs and the strength of 
the interaction is inversely proportional to the square of the distance between the atoms 
30 according to Coulomb's law. The optimal distance between ion pairs is about 2-8A. In 
protein/peptide interactions, electrostatic bonds may be formed between arginine, 
histidine or lysine and aspartate or glutamate. The strength of the bond will depend upon 



calculate the peptide binding score, Ligands with a high binding affinity for tie MHC 
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the pKa of the ionizing group and the dielectric constant of the medium although they are 
approximately similar in strength to hydrogen bonds. 

Lipophilic interactions are favorable hydrophobic-hydrophobic contacts mat occur 
between he protein and peptide ligand. Usually, these will occur between hydrophobic 
5 amino acid side chains of the peptide buried within the pockets of the binding groove 
such that they are not exposed to solvent Exposure of the hydrophobic residues to solvent 
is highly unfevorable since the surrounding solvent molecules are forced to hydrogen 
bond with each other forming cage-like clathrate structures. The resultant decrease in 
entropy is highly unfevorable. Lipophilic atoms may be sulphurs which are neither polar 

10 nor hydrogen acceptors and carbon atoms which are not polar. 

Van der Waal's bonds are non-specific forces found between atoms which are 3 -4 A apart. 
They are weaker and less specific than hydrogen and electrostatic bonds. The distribution 
of electronic charge around an atom changes with time and, at any instant, the charge 
distribution is not symmetric This transient asymmetry in electronic charge induces a 

15 similar asymmetry in neighboring atoms. The resultant attractive forces between atoms 
reaches a maximum at the Van der Waal's contact distance but diminishes very rapidly at 
about 1 A to about 2A. Conversely, as atoms become separated by less than the contact 
distance, increasingly strong repulsive forces become dominant as the outer electron - 
clouds of the atoms overlap. Although the attractive forces are relatively weak compared 

20 to electrostatic and hydrogen bonds (about 0. 6 Kcal/mol), the repulsive forces in 
particular may be very important in determining whether a peptide ligand may bind 
successfully to a protein. 

In one embodiment, the BShm scoring function (SCOEE1 approach) is used to estimate 
the binding constant (Bohm, H. J., J. Comput Aided Mol Des., 8(3):243-256 (1994) 

25 which is hereby incorporated in its entirety), hx another embodiment, the scoring function 
(SCORE2 approach) is used to estimate the binding affinities as an indicator of a ligand 
containing a T-cell epitope (Bohm, HJ.,J. Comput Aided Mot Des., 12(4):309-323 
(1998) which is hereby incorporated in its entirety). However, the Bohm scoring 
functions as described in the above references are used to estimate the bmding affinity of 

30 a ligand to aprotein where it is already known that the ligand successfully binds to the 
protein and the protein/ligand complex has had its structure solved, the solved structure 
being present in the Protein Data Bank ("PDB"). Therefore, the scoring function has 
been developed with the benefit of knownpositive binding data. In order to allow for 
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disoimioatioii between positive and negative binders, a repulsion term must be added to 
the equation. In addition, a more satisfectory estimate of binding energy is achieved by 
computing the lipophilic interactions in a pairwise manner raSher than using the area 
based energy term of the above BOtun functions. 

5 Therefore* in a preferred embodiment the binding energy is estimated using a modified 
B5hm scoring function. In the modified Bohm scoring function, the binding energy 
between protein and ligand (AG^ind) is estimated considering the following parameters: 
The reduction of binding energy due to the overall loss of translations^ and rotational 
entropy of the ligand (AGo); contributions from ideal hydrogen bonds (AGw,) where at 

10 least one partner is neutral; contributions from unperturbed ionic interactions (AGi Cn ic3; 
lipophilic interactions between lipophilic ligand atoms and lipophilic acceptor atoms 
(AG«po); the loss of binding energy due to the freezing of internal degrees of freedom in 
the ligand, i.e., the freedom of rotation about each C-C bond is reduced (AG^; the 
energy of the interaction between the protein and ligand (Evaw)- Consideration of these 

15 terms gives equation 1 : 

(AGbind) 2 ^ AG 0 ) + ( AGhbXNhb) + ( AGionicXNionie) + ( AGi ipo xNiipc) + ( 
AG ro t+Nrot) + (E vdw) - 

Where N is the number of qualifying interactions for a specific term and, in one 
embodiment, AGo, AGm* AGionfc, AGi it » 0 and AG** are constants which are given the 
20 values: 5,4, -4.7, -4.7, -0.17, and 1.4, respectively. 
The term Nm> is calculated according to equation 2: 

fpas 

fi(AR, Aa) is a penalty function which accounts for large deviations of hydrogen bonds 
from ideality and is calculated according to equation 3 : 
25 f(AR, A-a) = f 1 (AR) x £2 (Aa) 

Where: f 1 (AR) = 1 if AR <= TOL 

or =1 - (AR - TOD/0.4 if AR <= 0.4 + TOL 
or =0 if AR >0.4 + TOL 
And: f2(Aa) = 1 if Aa <30° 

30 or =l-( Aa - 30) /50 if Aa <=80° 

or =0 if Aa >80° 
TOL is the tolerated deviation in hydrogen bond length = 025h 
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is the deviation of the H-O/N hydrogen bond length from the ideal value — 1 .9 A 
Atx is the deviatioii of the hydrogen bond angle Z n/o-h_o/n from its idealized vjalue of 



fCNneight) distinguishes between concave and convex parts of a protein surface and 
5 therefore assigns greater weight to polar interactions found in pockets rather than those 
found at the protein surface. This function is calculated according to equation 4 below: 
f (N ne i ghb ) = (Nneighb/Nneighb^o) * where a = 0.5 

Nneigbb is the number of non-hydrogen protein atoms that are closer than 5A to any given 

protein atom. 
10 Nn«gbb,o is a constant = 25 

fpcs is a function which allows for the polar contact surface area per hydrogen bond and 

therefore distinguishes between strong and weak hydrogen bonds and its value is 

determined according to the following criteria: 
fi whenApoiar/NHB < 10 A 2 
15 or f P cs= 1 when Ap 0 iar/NHB > 10 A 2 

Apoiar is the size of the polar protein-ligand contact surface 

Nhb is the number of hydrogen bonds 

B is a constant whose value = 1-2 

For the implementation of the modified Bohm scoring function, the contributions from 
20 ionic interactions, A& 10nic? are computed in a similar fashion to those from hydrogen 
bonds described above since the same geometry dependency is assumed. 
The term Nk p * is calculated according to equation 5 below: 

f(rn,) is calculated for all lipophilic ligand atoms, 1, and all lipophilic protein atoms, L, 
25 according to the following criteria: 

f (r 1L ) =1 when <= Rlf(n L ) =(riL - R1)/(R2-R1) when R2 <r iL 
> Rl 

f {^xu) =0 when r iL >= R2 
Where: Rl ~ + r L vt4w +0.5 

30 and R2 « Rl + 3.0 

and xi"** is the Van der "Waal's radius of atom 1 
and rj** is the Van der Waal's radius of atom L 
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The termNrot is the number of rotable bonds of the amino acid side chain and is taken to 
be the number of acyclic sp 3 - sp 3 and sp 3 - sp 2 bonds. Rotations of terminal -CEfe or - 
NH3 are not taken into account 

The final term, Evctw, is calculated according to equation 6 below: 
5 Evd* - ^((n^ +r 2 vd ") 12 /r 12 - (n ir * f +r 2 vdw ) Vr 6 ) , where: 
Si and £2 are constants dependant upon atom identity 
t{* w W*™ are the Van der Waal's atomic radii 
r is the distance between a pair of atoms. 

With regard to Equation 6, in one embodiment, the constants si and Z2 are given the atom 

10 values; C: 0.245, N: 0.283, 0: 0316, S: 0.316, respectively (Le. for atoms of Carbon, 

Nitrogen, Oxygen and Sulphur, respectively). With regards to equations 5 and 6, the Van 
der Waal's radii are given the atom values C: 1.85, N: 1.75, 0: 1.60 ? S: 2.00A. 
It should be understood that all predetermined values and constants given in the equations 
above are determined within the constraints of current understandings of protein ligand 

15 interactions with particular regard to the type of computation being undertaken herein- 
Therefore, it is possible that, as this scoring function is refined further, these values and 
constants may change hence any suitable numerical value which gives the desired results 
in terms of estimating the binding energy of a protein to a ligand may be used and hence 
fell within the scope of the present invention. 

20 As described above, the scoring function is applied to data extracted from the database of 
side-chain conformations, atom identities, and interatomic distances. For the purposes of 
the present description, the number of MHC Class II molecules included in this database 
is 42 models plus four solved structures. It should be apparent from the above 
descriptions that the modular nature of the construction of the computational method of 

25 the present invention means that new models can simply be added and scanned with the 
peptide backbone library and side-chain confonnational search function to create 
additional data sets which can be processed by the peptide scoring function as described 
above. This allows for the repertoire of scanned MHC Class II molecules to easily be 
" increased, or structures and associated data to be replaced if data are available to create 

30 more accurate models of the existing alleles. 

The present prediction method can be calibrated against a data set comprising a large 
number of peptides whose affinity for various MHC Class II molecules has previously 
been experimentally determined. By comparison of calculated versus experimental data, 
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a cat of value can be determined above which it is known that all experimentally 
determined T-cell epitopes are correctly predicted. 

It should be understood that, although the above scoring function is relatively simple 
compared to some sophisticated methodologies that are available, the calculations are 
5 performed extremely rapidly. It should also be understood that the objective is not to 
calculate the true binding energy .per se for each peptide docked in the binding groove of 
a selected MHC Class H protein. The underlying objective is to obtain comparative 
binding energy data as an aid to predicting the location of T-cell epitopes based on the 
primary structure (i,e. amino acid sequence) of a selected protein. A relatively high 
io binding energy or a binding energy above a selected threshold value would suggest the 
presence of a T-cell epitope in the ligand. The ligand may then be subjected to at least 
one round' of amino-acid substitution and the binding energy recalculated. Due to the 
rapid nature of the calculations, these manipulations of the peptide sequence can be 
performed interactively within the program's user interlace on cost-effectively available 
15 computer hardware. Major investment in computer hardware is thus not required. 

It would be apparent to one skilled in the art that other available software could be used 
for the same purposes, In particular, more sophisticated software which is capable of 
docking Uganda into protein binding-sites may be used in conjunction with energy 
minimization. Examples of docking software are: DOCK (Kuntz et aL, J. Mol. Biol., 
20 161:269-288 (1982)), LUDI (BQhm, H.J., J Comput Aided Mol Des., 8:623-632 (1994)) 
and FLEXX (Rarey M., et aL, IS MB, 2:300-308 (1995)). Examples of molecular 
modeling and manipulation software include: AMBER (Tripos) and CHARMm 
(Molecular Simulations Inc.). The use of these computational methods would severely 
limit the throughput of the method of this invention due to the lengths of processing time 
25 required to make the necessary calculations. However, it is feasible that such methods 
could be used as a 'secondary screen' to obtain more accurate calculations of bmding 
energy for peptides which are found to be 'positive binders' via the method of the present 
invention. 

The limitation of processing time for sophisticated molecular mechanic or molecular 
30 dynamic calculations is one which is defined both by the design of the software which 
makes these calculations and the current technology limitations of computer hardware. It 
may be anticipated that in the future, with the writing of more efficient code and the 
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continuing increases in speed of computer processors, it may become feasible to make 
such calculations -within a more manageable time-frame. 

Further information on energy functions applied to macromolecules and consideration of 
the various interactions that take place within a folded protein structure can be found in: 

5 Brooks, B.R., et aL, J. Comput Chem., 4:187-217 (1983) and further information 

concerning general protein-ligand interactions can be found in; Danber-Osguthorpe et aL, 
FroteinsMX)S 1-47(1 988), which are incorporated herein by reference in their entirely. 
Useful background information can also be found, for example, in Fasman, G.D., ed., 
Prediction of Protein Structure and the Principles of Protein Corformation, Plenum 

10 Press, New York, ISBN: 0-306 4313-9. 



EXAMPLE 2 

The interaction between MHC, peptide and T-cell receptor (TCR) provides the structural 
basis for the antigen specificity of T-cell recognition- T-cell proliferation assays test the 

15 binding of peptides to MHC and the recognition of MHC/peptide complexes by the TCR. 
In vitro T-cell proliferation assays of the present example, involve the stimulation of 
peripheral blood mononuclear cells (PBMCs), containing antigen presenting cells (APCs) 
and T-cells. Stimulation is conducted in vitro using synthetic peptide antigens, and in 
some experiments whole protein antigen- Stimulated T-cell proliferation is measured 

20 using 3 H-thymidine ( 3 H-Thy) and the presence of incorporated 3 H-Thy assessed using 
scintillation counting of washed fixed cells. 



Buffy coats from human blood stored for less than 12 hours were obtained from the 
National Blood Service (Addenbrooks Hospital, Cambridge, UK). Ficoll-paque was 
25 obtained from Amersham Pharmacia Biotech (Amersbaco, UK). Serum free AIM V 
media for the culture of primary human lymphocytes and containing L-glutamine, 
SOfigfml streptomycin, lOjxg/ml gentomycin and 0.1% human serum albumin was from 
Gibco-BRL (Paisley, UK). Synthetic peptides were obtained from Eurosequence 
(Groningen, The Netherlands) and Babraham Technix (Cambridge, UK). 

30 

Erythrocytes and leukocytes were separated from plasma and platelets by gentle 
centrifugation of buffy coats. The top phase (containing plasma and platelets) was 
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removed and discarded. Erythrocytes and leukocytes were diluted 1 ;1 in phosphate 
buffered saline (PBS) before layering onto 15ml ficoll-paque (Amersham Pharmacia, 
Amershain UK). Centrifugation 'was done according to the manufacturers recommended • 
conditions and PBMCs were harvested fiom the serum+PBS/ficolI paque interface. 
5 PBMCs were mixed -with PBS (1 :1) and collected by centrifugation. The supernatant was 
removed and discarded and the PBMC pellet resuspeaded in 50ml PBS. Cells were again 
pelleted by centrifugation and the PBS supernatant discarded. Cells were resuspeuded 
using 50ml AIM V media and at this point counted and viability assessed using trypan 
blue dye exclusion. Cells were again collected by centrifugation and the supernatant 

10 discarded. Cells were resuspeaded for cryogenic storage at a density of 3xl0 7 per ml. 
The storage medium was 90%(v/v) heat inactivated AB human serum (Sigma, Poole, UK) 
and 10%(v/v) DMSO (Sigma, Poole, UK). Cells were transferred to a regulated freezing 
container (Sigma) and placed at — 70°C overnight When required for use, cells were 
thawed rapidly in a water bath at 37°C before transferring to 10ml pre- wanned AIM V 

15 medium, 

PBMC were stimulated with protein and peptide antigens in a 96 well flat bottom plate at 
a density of 2xl0 5 PBMC per well. PBMC were incubated for 7 days at 37°C before 
pulsing with 3 H-Thy (Amersham-Phamacia, Amersham, UK). For the present study, 

20 synthetic peptides (1 Smers) that overlapped by 12 amino acids were generated that 
spanned the entire sequence of hryodin 1. Peptide identification numbers (TD#) and 
sequences are given in FIGURE 2. Each peptide was screened individually against 
PBMCs isolated from 21 naive donors. Two control peptides that have previously been 
shown to be immunogenic and a potent non-recall antigen KXH were used in each donor 

25 assay. 

The control antigens used in this study were as below: 



Peptide 


Sequence 


C-32 


Biotin-PKYVKQNTLKLAT ; 
Flu haemagglutinin 3 07-3 1 9 


C-49 


KWDQIKKISKPVQH 
Chlamydia HSP 60 peptide 


KXH 


Whole protein from Keyhole 
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| | Limpet Hemocyanin. [ 

Peptides were dissolved in DMSO to a final concentration of IQmM, these stock solutions 
were then diluted 1/500 in AIM V media (final concentration 20f*M), Peptides were 
added to a flat bottom 96 well plate to give a final concentration of 2 and 20^M in a 

5 lOOpl. The viability of thawed PBMC's was assessed by trypan blue dye exclusion, cells 
were thenresuspended at a density of 2x10 s cells/ml, and IGQjil (2xl0 5 PBMC/well) was 
transferred to each well containing peptides. Triplicate well cultures were assayed at each 
peptide concentration,. Plates were incubated for 7 days in a humidified atmosphere of 
5% COi at 37°C. Cells were pulsed for 18-21 hours with lfiCi 3 H-Thy/well before 

10 harvesting onto filter mats. CPM values were determined using a Wallac microplate beta 
top plate counter (Peddn Elmer). Results were expressed as stimulation indices, 
determined using the following formula; 

Proliferation to test peptide CPM 
Proliferation in untreated wells CPM 

15 

Mapping T cell epitopes in the bryodin 1 sequeace using the T cell proliferation assay 
resulted in the identification of five major immunogenic regions encompassed by peptide 
ID#16-18, 30, 38-41, 46-50 and 60-64. For each of these peptides, responsive donors 
showed a stimulation index >2.0- 



20 
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Patent Claims 

1 . A modified molecule having the biological activity of hryodin 1 and being 
substantially non-immunogenic or less immunogenic than any non-modified 

5 molecule having the same biological activity when used in vivo. 

2. A molecule of olaiml, -wherein said loss of inummogenicity is achieved by 
removing one or more T-cell epitopes derived from the originally non-modified 
molecule. 

10 

3. A molecule of claim 2, wherein said loss of immunogenicity is achieved by 
reduction in numbers of MHC allotypes able to bind peptides derived from said 
molecule. 

15 4. A molecule of claim 2 or 3, wherein one T-cell epitope is removed; an accordingly 
specified molecule, wherein, said originally present T-cell epitopes are MHC class 
H hgands orpeptide sequences which show the ability to stimulate or bind T-cells 
via presentation on class II. 

20 5. A molecule of any of the claims 1-4, wherein said peptide sequences are selected 
from tihe group as depicted in FIGURE 1 . 

6. A molecule according to any of the claims 1-5, wherein 1 - 9 amino acid 
residues, preferably one amino acid residue in any of the originally present T-cell 

15 epitopes are altered; an accordingly specified molecule, wherein the alteration of 

the amino acid residues is substitution, addition or deletion of originally present 
amino acid(s) residue(s) by other amino acid residues) at specific position(s). 

7. A molecule of claim 6, wherein, if necessary, additionally further alteration 

o usually by substitution, addition or deletion of specific amino acid(s) is conducted 

to restore biological activity of said molecule. 



c -~* -~;+'ii/nR/9nn? 14:50 



Fmc-f nr . :fi9R P fllTR 



- I02103P-BZ-020611.doc 



11 JUN. 2002 14:55 



00 




51 727191 



NR. 365 



S. 9 



34 /3S 



8, A peptide molecule of any of the above-cited claims shari n g greater than 90% 
amino, acid identity with any of the peptide sequences of FIGURE 1 > 

9. A peptide molecule of claim 8 sharing greater than 80% amino acid identity with 
any of the peptide sequences of FIGURE 1. 



10. A peptide sequence according to any of the claims i -9, that are able to bind MHC 
class H. 



11. A bryodin 1 molecule according to any of the claims 1 - 10, wherein one or more 
of the flrmnp acid substitutions is conducted at a position corresponding to any of 



10 



15 



20 



25 



the amino acids specified within FIGURE 1. 

12. A molecule of claim 1 1 , wherein alteration is conducted at one or more residues 
from any or all of the string of contiguous residues of sequences: 

(a) ITTLYYYTASSAASALLVLIQSTAESA, 

(b) = ATEAAKFVFKDAKKK, 

(c) = ERLQTAAGKIRENIPLGLPALDSA, 

(d) = ITTLYYYTAS S AASALLVLIQSTAESA, 

(e) = TPLPSLATISLENNWSALSKQIQIAST, 

wherein said sequences are derived from the bryodin 1 wild-type sequence. 

13. A peptide molecule of claim 12, comprising 13-15 consecutive residues from any 
of sequences (a), <b), (c), (d) or (e). 

14. A peptide molecule of claim 12, comprising at least 9 consecutive residues from 
any of the sequences (a), (b) ? (c) (d) or (e), 

15. A peptide molecule according to claims 12 - 14, sharing greater than 90% amino 
acid identity with any of the peptide sequences derived from (a), (b) ? (d) or (e) 
above; 

16. A peptide molecule of claims 12 - 14, sharing greater than 80% amino acid 
identity with any of the peptide sequences derived from (a), (b), (c) (d) or (e). 



•rw; p nnq 
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17. A peptide sequences of any of the claims 12-16, that are able to bind MHC class 
It 

IS, A btyodin 1 molecule, wherein one or more of the amino acid substitutions is 
5 conducted at a position corresponding to any of the amino acids specified -within 

any of sequences (a), (b), (c) 9 (d) or (e) of claim 12. 

19. A bryodin 1 molecule of claim 18, wherein one or more of the amino acid 
substitutions is conducted at a position corresponding to any of the amino acids 
10 specified within sequence (b) above. 

SSTjA peptide sequence consisting of at least 9 consecutive amino acid residues of any 
of the sequences (a), (b), (c), (d) or (e) of claim 12 wad its use for the manufacture 
of bryodinlj a-trichosanthin, a-momorcharin or ]5-momorcharin having 
15 substantially no or less immunogenicity than any non-modified molecule and 

having the biological activity of a type 1 RIP when used in vivo. 

21 . A pharmaceutical composition comprising any of the peptides or modified 
peptides of any of the claims 1-20 having the activity of binding to MHC class H. 

20 

22. A DNA sequence or molecule which codes for any of said specified modified ' 
molecules as defined above and below. 

23. A pharmaceutical composition comprising a modified molecule according to any 
25 of the claims 1-20 having the biological activity of bryodin 1. 

24. A pharmaceutical composition of claim 23, optionally together with a 
phannaceutically acceptable carrier, diluent or excipient 

30 25- A method for manufacturing a modified molecule having the biological activity of 
bryodin 1 as defined in any of the claims of the above-cited claims comprising the 
following steps: Q) determining the amino acid sequence of the polypeptide or 
part thereof; (ii) identifying one or more potential T-cell epitopes .within the amino 
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acid sequence of the protein by any method including determination of the binding 
of the peptides to MHC molecules using in vitro or in siUco techniques or 
biological assays; (Hi) designing new sequence variants with one or more amino 
acids within the identified potential T-cell epitopes modified in such a way to 

5 substantially reduce or eliminate the activity of the T-cell epitope as determined 

by the binding of the peptides to MHC molecules using in vitro or in silico 
techniques or biological assays; (iv) constructing such sequence variants by 
recombinant DNA techniques and testing said variants in order to identify one or 
more variants with desirable properties; and (v) optionally repeating steps (ii) - 

10 (iv). 

26. A method of claim 25, wherein step (lii) is carried out by substitution, addition or 
deletion, of 1 - 9 amino acid residues in any of the originally present T-cell 
epitopes. 

15 

27. A method of cl aim 25 or 26, wherein the alteration is made with reference to an 
homologous protein sequence and / or in silico modeling techniques. 

28. A method of any of the claims 25 - 27, wherein step (ii) of above is carried out by 
20 the following steps: (a) selecting a region of the peptide having a known amino 

acid residue sequence; (b) sequentially sampling overlapping amino acid residue 
segments of predetermined uniform size and constituted by at least three amino 
acid residues from the selected region; (c) calculating MHC Class II molecule 
binding score for each said sampled segment by summing assigned values for 

25 each hydrophobic amino acid residue side chain present in said sampled amino 

acid residue segment; and (d) identifying at least one of said segments suitable for 
modification, based on the calculated MHC Class n molecule binding score for 
that segment, to change overall MHC Class II binding score for the peptide 
without substantially reducing therapeutic utility of the peptide; step (c) is 

30 preferably carried out by using a Bohm scoring function modified to include 12-6 

van der Waal's ligand-protein energy repulsive tenn and ligand conformational 
energy term by (1) providing a first data base of MHC Class n molecule models; 
(2) providing a second data base of allowed peptide backbones for said MHC 
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Class II molecule models; (3) selecting a model from said first data base; (4) 
selecting an allowed peptide backbone from said second data base; (5) identifying 



repeating steps (1) through (5) for each said model and each said backbone. 

29. A 13mer T-cell epitope peptide having a potential MHC class II binding activity 
and created from non-modified biyodin 1 > selected from the group as depicted in 
FIGURE 1 and its use for the manufacture of bryodin 1 having substantially no or 
less inimunogenicity than any non-modified molecule with the same biological 
activity when used in vivo. 

30. A peptide sequence consisting of at least 9 consecutive amino acid residues of a 
13mer T-cell epitope peptide as specified above and its use for the manufacture of 
biyodin having substantially no or less immunogenicity than any non-modified 
molecule and having the biological activity of biyodin 1 when used in vivo. 

31. A 13mer T-cell epitope peptide having a potential MHC class II binding activity 
and created from non-modified bryodin 1 , selected from any of the group of 
sequences (FIGURE 1) and its use for the manufacture of bryodin 1 having 
substantially no or less immunogenicity than any non-modified molecule and 
having the biological activity of a biyodin 1 molecule when, used in vivo. 

32. A peptide sequence consisting of at least 9 consecutive amino acid residues of a 
13mer T-cell epitope peptide as droved from any of the sequences in FIGURE 1 
and its use for the manufacture of bryodin 1 having substantially no or less 
immunogenicity than any non-modified molecule and having the biological 
activity of a bryodin molecule when used in vivo. 



amino acid residue side chains present in each sampled segment; (6) detennining 
the binding affinity value for all side chains present in each sampled segment; and 



30 
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Abstract 

The present invention relates to polypeptides to be administered especially to humans and 
5 in particular for therapeutic use. The polypeptides are modified polypeptides whereby the 
modification results in a reduced propensity for the polypeptide to elicit an immune 
response upon administration to the human subject The invention in particular relates to 
the modification of bryodin 1 to result in bryodin 1 proteins that are substantially non- 
immunogenic or less immunogenic than any non-modified counterpart when used in vivo. 
10 The invention relates furthermore to T-cell epitope peptides derived from said non- 
modified protein by means of which it is possible to create modified bryodin 1 variants 
with reduced immuno genicity . 
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VS FRLS GATTTS Y , 
YGVFIKNLREALP , 
KNLREALPYEKKV, 
RKVYNIPLLRS5I, 
IPLLRSSISGSGR, 
GRYTLLHLTNYAD , 
LHLTNYADETISV, 
ISVAVDVTNVYIM, 
TNVYIMGYLAGDV, 
YIMGYLAGDVSYF, 
GDVS YFFNEASAT , 
YFFNEASATEAAK, 
FVFKDAKKKVTLP , 
LPYSGNYERLQTA, 
GKIRENIPLGLPA, 
LGIiPALDSAITTL, 
TTLYYYTASSAAS , 
YYYTAS S AASALL , 
LLVLIQSTAESAR r 
ARYKFIEQQ I GKR < 
QQIGKRVDKTFLP, 
TFLPSLATXSLEN, 
I SLENNWSAL5KQ r 
KQI QI A5TNNGQF , 
SPWLIDGNISQRV, 
VLI DGNNQRVS IT , 
ARWT SN I ALLLN , 
IALIiLNRWNIAAIr 
NNIAAIGEDISHT r 
ISMTLIGFEHGLY, 



FRLSGATTTSYGV, 

GVFXKNLREALPY 

EALPYERKVYNIP 

KVYNIPLLRSSIS 

PLLRSSISGSGRY 

YTLLHLTNYADET 

TNYADETI3VAVD 

VAVDVTNVYIMGY 

NVYIMGYLAGDVS 

MGYLAGDVSYFFN 

VSYFFNEASATEA, 

AKFVFKDAKKKVT 

KKVTLPYSGNYER 

GNYERLQTAAGKI 

ENX PLGLPALDS A 

PALDSAITTLYYY 

TLYYYTA5SAASA 

5ALLVLIQSTAE5 

LVLIQSTAESARY 

YKFIBQQIGKRVD 

KRVDKTFLPSIAT 

P SLAT I S LENNWS 

NtfWSALSKQIQIA 

IQXASTOTIGQFES 

PWLIDGNNQRVS 

QRVSITOASARW 

RWTSNIALLLNR 

ALLLNRITOIAAIG 

AAIGBDISMTIiIG 

MTLXGFEHGLYGI 



TSYGVFIKNLREA, 
VFIKNLREALPYE r 
LPYERKVYNIPLL, 
YNXPLLRSSISGS, 
SSXSGSGRYTLLH, 
TLLHLTNYADETI r 
ETISVAVDVTNVY r 
VDVTT3VYIMGYLA r 
VYIMGYLAGDVSY, 
GYIAGDVSYFFNE, 
SYFFNEASATBAA, 
KFVFKDAKKKVTL, 
VTLP YS GNYERLQ , 
ERLQTAAGKIREN r 
I PLGLPALDS AI T r 
SAITTL YYYTAS S r 
L YYYTAS 5 AAS AL r 
ALLVLIQSTAE5A r 
VLIQSTAESARYK r 
KFIEQQI GKRVDK r 
KTFLF5LATISLE, 
ATISLENIWSALS r 
SALSKQIQXASTN, 
GQFESPWLIDGNr 
WLI DGNNQRVS I r 
VSITNA5ARWTS , 
SNIALLLNRNNIA, 
LLLNRtJN I AAI GE , 
EDISMTLIGFEHG r 



_ r 



■nnr n m a 
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FIGURE 2 



Peptide 

n># 


Bryodtni; ismerpepede 
sequence 






Peptide 
IDS 


Bryodfal; IStner peptide 
sequence 


Residue # 


1 


DVSFKLSGATTTSYG 


1 




44 


PRLDSAITTLYYYTA 


130 


2 


PRLSGATTTSVGWX 


4 




45 


D S A3 TUXYYYTA5SA 


133 


3 


S<5AT!TTS¥GVFIKWL 


7 




46 


ITTLYYYTASSAASA 


136 


4 


TTT5YGVFZKNLBEA 


10 




47 


LYYYTAS5AASALLV 


139 


5 


SYGVFIKBLREALPY 


13 




4B 


YTASSAASALLVLIQ 


1*2 


6 


VFI KULMSAIiPYERK 


16 




49 


SSA&SALLVLIQSTA 


145 


7 




19 




50 


ASftLLvtilQSTAESA 


148 


8 


REALEYEBKVYNIPL 


22 




51 


Li VLIQST AE SARYK 


151 


9 




25 




52 


LI05TAESAK3CKPIE 


154 


10 


ESKVYNIFLXRSSIS 


28 




53 


STAESAMTRFI EQQX 


157 


11 


VYNISLLRSSISGS G 


31 




54 


E5AEOTFIEQQIGKR 


150 


12 


iPIiRSSXSGSGRYT 


34 




55 


RYKFIEQQlGKRVDK 


1R3 


13 


LR5SISG5GRYTLLH 


37 




56 


PT200I GKRVDKTFTi 


1fifi 


14 


SIS SS GP.TXIiLHLTII 


40 




57 


QQ ICKBVDKTPLP 3L 


ICQ 


15 




43 




58 


GKRV OOTPLB SlATI 




16 


KYTLLHLTNYADETI 


46 




59 


VDK TFLPSLAT ISTiE 


17^ 


17 


LLHXiT NYADET I S VA 


49 




£0 


TFILPSTiAfT ^T.PWWtt 


1/b 


18 


XtftfYADSTISVAVpv 


52 




61 


PSLAT I 


Jt>1 


19 


YADET I S VAVD VTNV 


55 




62 


AT I st..enrws at.s pto 


10<» 


20 


EXISVAVDVlSVYm 


58 




63 


ST^SNMWSAT^ICQTftT 


f QT 

1BT 


21 


svav p v r i'H v incuts yi. 


51 




84 


WITHSAliSKQlOIAST 


I9U 


22 


VBVTFVYlWGYfcAGP 


64 




65 


Salskqxqiastnng 


193 


i 23 


TNVYIMGYIAGDVSY 


67 




65 


SKCIOXASTNNGOFP 




24 


Y utgyl&gp vs yffh 


70 




67 


IQI ASTNNGOPeS pv 


4 QQ 


25 


GYXAGDVSYFFNBA3 


73 




68 


ASTNNGOPESPWT.T 




26 


AGDVSYPFilEASATE 


76 




69 


NNGQFE5 PWLI DGN 




27 


V5YFFNEASATEAAK 


79 




70 


QFESPVVLIDGHKQR 


208 


28 


FFT3EA5ATEAAKFVF 


82 




71 


SPWLIDCNNQRVSI 


211 


29 




85 




72 


_ VLI DGUFQpysiW A 


214 


30 


ATEaAKFVFKDAKKK 


88 




73 


I TNAS AR 


217 


31 


AAKFVFKDAKKKVTL 


31 




74 


NQRVS ITNASARWT * 


220 


32 


PVFKDAKKKVTI.PY3 


94 




75 


V3ITKASARVVT 5WI 


223 


33 


KDAKKKVTIiPYSGNY 


97 




76 


THASARWTSNTAT.T. 


226 


34 


KKKVTLPY3GNYERL 


100 




77 


SARVVT9NIALLLNR. 


229 


35 


VTIiPYSGNYEIUjOTA 


103 I 




78 


WTSNIAIOjLB^NI?! 


232 


36 


PYSGNYERLQTAAGK 


106 




79 


SNIRX.LI.iaRNNIAAI 


235 


37 


GWYERLQTSAGKIRB 


109 




80 


ALLLNBNNIAAIGED 


Z38 


3B 


EFMTAAGKXR£»IP 


112 




81 


LNRNC3IAAXGHPZSM 


241 


39 


QTAAGKIRENI FLGL 


115 




82 


lTOIAAIGEDI5EEtt.I 


244 


40 


AGKIRENI FLGLFAL 


118 




83 


AAISEDISMTLICFE 


247 


41 


IBEPTIPLG1PAJ.PSA 


121 




84 


GSD 1 3MT I/I GFEHGL 


250 


42 


HIPfcGLPALDSAITT 


124 




86 


2 5MTLIG FEHGLYGI 


253 


43 


LGLPALDSAITTLYY 


1?7 
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FIGURE 3 



Epitope R1 (residues 46-66) 

1BRY R1TLLHLT1SIY2U3ETISVAVI1V 



1TCS 
1CF5 



;:A;I: :I: 
:FI*. s :3: :Y: :::::: : 



Epitope R2 (residues 86-102) 



1BRY 
1TCS 



ATEAAKFVFKDAKKK 

; : : : : :Y: : : : :MR: 



Epitope R3 (residues 112-135) 

EBLQTAAGKIRENIPLGLPaiiDSA 

:S 



1BRY 
1TCS 
1CFS 
1MOM 



:N: ::::H::: 

* « • « A « - • • A » 
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Epitope R4 (residues 136-162) 



1BRY 
1TCS 
1CF5 
1MOM 



ITTLYYYTAS SA&SALLVLI QS TAESA 



: : : :F: :N:N: : : : : :M 
: : : :F: :N:Q: :P: : : : 
:S: :LH:DST&: :G: : : 
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Epitope R5 (residues 178-204) 

1BRY TFLPSLAIISLEWMWSaLSKGIQIAST 

J_ J. L-O ••»»•■■■ 

1CF5 N:K: :: :I ::: {Q: : :FL-.QN 

1MOM DEV: :::::::: :S: :G: ::: : :L:QG 



